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The  project  "Comprehension  of  Dhistrated  Text”  is  directed  toward  producing  an  understanding  of  die 
fadlitative  effects  of  pictures  tm  comprefaoision  of  texts.  Both  conif«efaaision  inocesses  and  die  lesu^  of  diose 
processes  (cognitive  rqxeseotaUns)  are  considered.  This  tedinical  report  is  divided  into  two  sections.  The  first 
pies^  a  summanr  of  work  ctmqikxed  during  the  previous  year  and  wnk  in  progress,  including  a  list  of 
pidilications  and  presentations,  llie  second  section  is  a  complete  report  of  experimoital  work  deigned  to 
mvestigate  one  comprehension  processes  affected  by  pictures. 

Section  I:  Summary  of  work  convicted  and  in  progress 

A.  W(Rk  Crmqileted 

1.  Pictures  facilitate  construction  of  mental  models.  Our  experimental  work  has  demonstrated  that  <»ie  way 
in  which  pictures  help  conqxdiension  is  by  providing  a  firamework  for  the  cmismiction  of  a  nuntal  model,  a 
representation  of  wdiat  the  text  is  about  (e.g.,  the  components  of  a  machine  or  the  steps  in  a  procedure)  as  oi^sed 
to  a  representation  of  the  text  itself  (words  and  s«itences).  This  woric  is  rqxnted  more  fiilly  in  die  19^  Annual 
Tedinicai  Report,  and  a  version  of  it  (doiberg  and  Langston)  is  in  press. 

2.  Pictures  and  anaphora.  An  inqxirtant  omqionait  of  text  conqinehaision  is  the  resolution  of  anaidioric 
relations.  That  is,  anaphora,  sudi  as  pronouns,  require  reference  to  previously  present^  information, 
atecedents,  to  uncover  their  contextually  oqilicit  mealing.  Anaphor  resolution  is  a  major  conqxinent  in 
providing  textual  coherence.  We  hypod^i^  tha  pictures  would  facilitate  amqihmr  resolution  by  providing  an 
external  mem^  for  potential  antecedents.  Thus  a  glance  a  the  picture  would  provide  access  to  antecedents  tha 
would  odierwise  have  to  be  uncovered  through  a  more  laxirious  seardi  of  the  tea  w  mesnny.  This  hypothesis 
received  little  onpirical  support  A  report  of  the  major  experiments  will  be  submitted  for  publication  by  Augua 
IS,  1991,  and  the  experimaits  are  described  in  detail  in  Section  n  of  this  tedinical  rqxxt 

B.  Work  in  progress 

1.  Work  on  the  "noticing  hypothesis.”  We  have  evidaice  (Glenbetg  and  Langston)  tha  pictures  he4) 
readers  to  fcxm  mental  models.  But  why  should  this  hel^  conqirdiension?  One  hyp^esis  is  tha  the  menta 
models  allow  readers  to  "notice"  significaa  relations  in  die  model  of  the  situation  t^  are  not  explicit  in  die  text 
We  have  conducted  several  eiqieriments  testii^  dus  hypothesis,  but  we  ae  not  satisfied  with  any  of  them. 

Internal  andyses  of  the  results  reveal  dia  subjects  in  foe  experimaits  have  found  ways  of  doing  dia  task  tha 
preclude  noticing. 

2.  External  memory  hypt^esis.  We  have  conducted  experimaits  testii^  foe  hypothesis  tha  pictures  act  as 
a  form  of  external  memc^  vfoidi  can  be  used  to  supplement  a  reader’s  woiki^  manmy.  So  fa,  we  have  been 
able  to  demonstrate  tha  pictures  do  not  enhance  gaasal  working  memory  capacity,  but  fo^  do  affect  foe 
COTiponeot  of  woridng  memory  idaitified  as  foe  visual-spatial  sketchpad. 

3.  Pictures  and  men^  for  wha  people  ssy.  Professor  Thomas  Grimes  (now  a  Kansas  State  University; 
and  I  have  been  collaborating  on  a  project  to  determine  if  seeing  a  picture  of  a  speaker  he^s  one  to  remember  vfo 
the  ^leaker  has  said.  We  have  con^<^  experiments  tha  danonstrate  a)  pictures  do  focilitate  manory  in  this 
w^,  and  b)  foe  effect  dqiends  on  using  actiuti  pictures  of  speakers;  it  is  not  found  whai  geometric  symbols 
r^l^  the  faces.  A  curratt  experiment  is  investigating  two  explanaory  hypotheses.  The  first  is  foa  having  foe 
picture  allows  the  subject  to  aeate  a  more  concrete  (and  manorable)  r^esaitation  of  foe  information.  The 
second  is  tha  subjects  use  foe  picture  to  generae  a  stable  personality  profile  of  foe  speaka  and  tha  foe  speaker’s 
statanaits  ae  int^rated  into  t^  profile. 

4.  Conqnita  simulation  My  gradu^  students  and  I  have  developed  a  computa  simulation  of  foe  major 
processes  tha  we  believe  occur  when  reading  a  text  accompanied  by  a  picture.  The  simulation  derives 
propositions  from  sinqilified  texts  and  uses  foose  propositions  to  build  mental  models  of  foe  situation  described 
by  foe  text  Based  on  relations  within  foe  mental  modd,  new  propositions  ae  inferred.  Pictures  are  used  in 
several  wttys.  First,  pictures  disambiguate  relations  described  by  foe  text  For  exanq)le,  if  foe  t«t  staes  foa  "A 
is  nea  B,”  it  is  not  clea  if  A  is  to  foe  left  of  B,  foe  right  of  B,  unda  B,  etc.  Second,  pictures  provide  metric 
information  about  locations.  Third,  pictures  reduce  foe  amount  of  cognitive  processing  requir^  to  maintain 
symbols  in  foe  mental  model  when  foose  symbols  ccrrespond  to  objects  in  foe  picture.  Our  initial  effmts  whh  ttu 
opnqnita  modd  has  produced  a  gratifying  d^ree  of  success  in  simulations  of  text  comprehension  experiments, 
memory  acperiments,  and  map  learning  experimaits. 
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C.  Publicatioas  and  presentations. 

Glenbetg,  A.  M.  Building  mental  models  from  text  A  simulation  of  comprehension  and  memory.  Invited 
address,  ARMADILLO-n  conference,  Texas  A&M  University,  May,  1991. 

Glenb^,  A.  M.  Comprension  del  texto  illustrado:  los  dibujos  ayudan  a  crear  modelos  mentales.  Invited 
address.  University  of  Salamanca,  July,  1991. 

Glenbetg,  A.  M.  Buildup  mental  models  from  text  A  simulation  of  congvdioision  and  memory.  Invited 
address.  Human  Conununication  Researdi  Centre,  University  of  Edinburgh,  July  1991. 

Glenberg,  A.  M.,  and  Langston,  W.  E.  (in  press).  Con^irehension  of  illustrated  text  Pictures  help  to  build 
mental  models.  Journal  ofManory  and  Language. 

Gloiberg,  A.  M.,  and  Langston,  W.  E  (1991,  May).  Conqnehension  of  illustrated  text:  Pictures  help  to 
build  mental  models.  In  A.  M.  Glenberg  and  M.  McDaniel  (Co-^airs),  Mental  models,  pictures,  and  texx: 
integration  of  spatial  and  verbal  Ufbrmation.  Symposium  cmutucted  at  die  meeting  of  die  hridwe^era 
Psydiological  Associatioa 

Glenberg,  A.  M.,  and  Kruley,  P.  (manuscript  submitted  for  publication).  Pictures  and  amqdiora:  Evidence 
for  indepoidait  processes.  Memory  A  Cogration 


AFOSR-89-0367 


Page3‘ 

Section  0;  Pictures  and  anaphora:  Evidence  fcM*  independettt  processes 

Abstract 

Pictures  enhance  comprehension  of  written  texts,  but  die  perceptual  and  cognitive  processes  that  underlie 
this  effect  have  not  been  idoitified.  Because  int^radng  the  infmination  contained  in  a  text  places  demands  on 
woridiig  memmy,  the  effect  of  a  picture  may  be  to  expand  die  functional  capacity  of  woiidng  memory,  and 
diereby  facilitate  comprehension.  Based  on  this  reasoning,  we  predicted  that  the  availability  of  a  diagram  would 
interact  with  die  difficulty  of  resolving  ant^dior  references  in  texts.  Resolution  of  an  anaphor  distant  from  its 
antece^nt  (whidi  should  stress  working  memory)  should  benefit  greatly  horn  a  picture,  whereas  resolution  of  an 
anaphor  near  to  its  antecedent  should  benefit  less  from  a  picture.  In  experiments  involving  both  cumulative  and 
moving  window  presentations  of  texts,  picture  availability  and  distance  separating  antecedent  from  anchor  were 
manipulated  ccmcurroitty.  Ahhougfa  bodi  picture  presence  and  ease  of  anther  resolution  significantly  improved 
subjects'  conp-ehension  of  die  material,  no  evidence  was  found  ftx*  an  interaction  of  these  factors.  The  results 
are  interpreted  as  consistent  widi  either  dual  code  theory  or  aspects  of  waking  monory  managonent  diat  do  not 
invtdve  aru^or  resolution. 
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It  is  a  ftct  ttiat  pictures  he^  pec^le  tt)  learn  from  texts  (see  Willows  and  Houghton,  1987,  for  reviews  of  the 
Ikerature).  What  is  less  certain,  is  exactly  how  pictures  have  diis  salutary  effect  The  experiments  reported  here 
were  designed  to  investigate  a  possible  b«ieficial  effect  of  picture  on  an  inportant  process  of  text 
conqrrdiension,  aiuqdKH*  resolutioa  To  fiveshadow  our  results,  we  found,  once  again,  that  pictures  enhance 
comprefaaision.  Nonetfadess,  we  were  not  able  to  adduce  any  evidence  that  the  enhancement  was  due  to  a  direct 
effect  of  pictures  on  aiuqriior  resohitioiL  We  discuss  several  ways  in  which  pictures  can  aid  comprdiension 
widiout  affecting  amqrhor  resolution. 

Our  tact  is  to  investigate  die  general  hypothesis  that  pictures  enhance  woridng  memory  management  Most 
theories  of  text  ctmqirdiension  (e.g.,  Kintsch,  1988)  specify  diat  an  important  aspect  of  comprehension  is  die 
building  of  a  represaitation  (see  also  Gemsbacher,  Varner,  and  Faust  1990),  and  diat  the  rqiresentation  is  built 
using  the  facilities  of  working  memory.  Because  working  memory  is  diought  to  be  severely  limited  in  the  amount 
of  informadon  that  can  be  rqiresented  at  one  time,  the  comprehender  may  find  it  necessary  to  shunt  information 
into  and  out  of  working  memory  (diat  is,  to  manage  it)  while  building  the  appropriate  structures.  Fletcher  and 
Bloom  (1988),  for  example,  have  investigated  various  strategies  for  controlling  the  contents  of  working  memcny 
during  reading.  Picturesmay  facilitate  working  memory  management  in  several  ways.  For  example,  pictures 
can  serve  as  an  external  memory,  so  that  die  reader  can  avoid  searches  of  long-term  memory  or  the  text  itself, 
when  information  is  required.  Larkin  and  Simon  (1987)  demonstrated  how  pictures  may  enhance  a  search  of  this 
sort  Glenberg  and  Langston  (in  press)  demonstrated  that  pictures  can  lead  to  a  reorganization  of  information  in 
working  memory.  In  that  experiment  subjects  read  texts  describing  four-step  procedures.  The  texts  described 
both  die  content  of  the  steps  and  die  order  in  ushich  the  steps  were  to  be  performed  When  reading  without  a 
picture,  subjects  tended  to  represent  the  stqis  in  die  order  in  whidi  diey  were  presented  in  the  text  When  reading 
the  same  texts  accompanied  by  pictures  illustrating  the  order  of  the  steps  when  the  procedure  is  executed,  subjects 
t^ed  to  represent  the  order  of  the  st^s  as  executed,  rather  than  as  described  in  the  text 

Anaphora  resolution  is  a  conqirdiension  process  that  is  sensitive  to  wtxking  memory  management  An 
anchor  is  a  word  or  phrase  diat  is  interpreted  in  relation  to  previous  elements  of  the  discourse  (the  antecedent  or 
referent  of  die  anaphor).  Pronouns  are  prototypical  anaphon,  in  that  the  interpretation  of  a  pronoun  (e.g.,  the 
word  "it")  often  requires  reference  to  a  previous  description  in  the  text  Nonetheless,  anaf^ors  may  be  noun 
phrases,  synonyms  of  previous  nouns,  temporal  referents  such  as  "the  former"  (Jakimik  and  Glenberg,  1990),  or 
even  the  absence  of  a  noun  (see  Halliday  and  Hasan,  1976).  Successful  anaphor  resolution  is  probably  an 
important  component  in  generating  a  sense  of  coherence,  that  the  sentences  in  a  text  belong  tog^er. 
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The  process  of  anaphor  resolution  is  sensitive  to  distance  between  the  anaphor  and  its  antecedent  It  takes 
kmger  to  acconq)lish  amg^ior  resolution  when  the  anaphor  and  its  antecedoit  are  far  from  one  anther  in  the  text 
than  when  they  are  close  together  (e.g,  in  adjacent  sentences).  Dononstrations  of  this  distance  effect  may  be 
found  in  Claric  and  Sengul  (1979),  Daneman  and  Carpenter  (1980),  and  Ehrlich  and  Rayno*  (1983). 

A  standard  interpretation  of  this  distance  effect  is  that  it  reflects  a  type  of  wmicing  monory  managonenL 
Antecedents  that  are  near  to  their  anaphors  are  likely  to  be  represented  in  working  memory  (or  foregrounded) 
whoi  the  anaplKM*  is  read.  When  the  anteced^  is  far  from  the  am^^r,  the  antecedent  may  not  be  in  working 
memory,  and  it  must  be  reinstated  to  resolve  the  anaphor.  Ihis  reinstatemem  process  takes  time  and  produces  the 
distance  effect 

We  anticipated  that  pictures  would  facilitate  this  reinstatonent  process  and  therdry  provide  fn-  smooth  and 
effective  conqrrehensioiL  Our  reasoning  was  based  on  the  idea  drat  a  picture  may  serve  as  an  external  moiKxy 
whid)  is  easier  to  search  than  either  the  physical  text  or  die  representation  of  the  text  in  long-term  memory  (Laiidn 
and  Simon,  1987).  This  reasoning  predicts  an  interaction  betwem  distance  and  vdiether  or  not  a  picture 
accompanies  the  text.  When  the  antecedent  and  the  anaphor  are  close  in  the  text,  the  antecedent  should  be 
represented  in  woridng  memory,  so  that  the  picture  makes  litde  difference.  However,  when  the  antecedent  and  the 
anaphor  are  far  apart,  the  antecedent  needs  to  be  reinstated  in  working  memory,  and  die  picture  should  facilitate 
rdnstatonent 

Although  we  anticipated  this  interaction,  our  eiqieriments  were  designed  to  discriminate  among  several 
hypotheses.  First,  it  m^  be  that  pictures  need  to  be  perceptually  present  to  facilitate  reinstatement  of  antecedents 
(perceptual  hypothesis).  Second,  it  may  be  diat  exposures  to  pictures  creates  a  memory  rqiresentation  (of  the 
picture)  that  can  be  used  to  facilitate  reinstatement  in  the  absence  of  the  i^ysical  picture  (memcny  hypothesis). 
Third,  contrary  to  our  apectations,  pictures  may  not  affect  anrgihor  resolution,  but  may  enhance  comprdiension 
through  odier  means  (general  facilitation  hypothesis). 

Subjects  read  texts  containing  a  short  scientific  description  of  a  single  object,  typically  a  living  organism,  a 
part  of  an  organism,  or  a  mechanical  device  (some  examples:  the  sphagnum  sporophyte,  the  structure  of  a  leaf, 
and  a  solar  collector).  Either  four  or  five  distinct  parts  of  the  objea  were  described  in  the  text;  antecedent 
sentences  described  names  and  locations  of  the  parts,  and  amgdior  sentences  rqieated  the  ^atial  location  (but  not 
the  name)  and  jn-esented  one  fact  about  the  part.  Each  text  used  in  die  study  conformed  to  the  structure  shown  in 
Table  1.  This  structure  includes  two  anaphoric  references,  one  to  a  referent  mentioned  in  die  immediately 
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Table  1 

Sample  Text  Used  in  the  Study 


Segment  Name 
Introduction: 

Antecedent  1: 

Filkn: 

Antecedent  2: 

Near  Anaphor: 
Cc^erence  Sentence  1: 
Far  An^or 
Coherence  Sentence  2; 


The  sphagnum  sporophyte  is  the  rqmxluctive  syst^  of  a  fungus  that  grows  on 
plants. 

The  operculum  is  at  the  top  of  the  sf^agnum  spox^hyte. 

The  fungus's  spores  are  contained  in  an  arc-shig>ed  central  column  surrounded  by  a 
sterile  r^on  called  the  columella. 

The  pseudopodium  is  on  the  bottom  of  die  fungus. 

The  part  on  the  bottom  elevates  die  entire  sporoidiyte. 

Elevation  increases  the  distribudmi  of  spores  when  diey  are  released. 

The  part  at  the  top  maintains  internal  pressure 

This  internal  pressure  is  v^hat  will  eventually  cause  the  sphagnum  sporoidiyte  to 
pop  and  release  the  spores. 


Near  Question: 
Far  Question: 
Filler  Question: 


What  does  the  pseudopodium  do?  a.  releases  spores  b.  elevates  die  sporophyte  c. 
nourishes  the  sporophyte  d.  creates  pressure.  (Answer:  b.) 

What  is  the  fiinction  of  die  operculum?  a.  to  elevate  the  sporofdiyte  b.  to  distribute 
spores  c.  to  maintain  internal  pressure  d.  to  aid  in  attachment  (Answo-:  c.) 
Which  part  surrounds  the  central  diambei?  a.  foot  b.  columella  c.  psoidopodium 
d.  operculum  (Answa*:  b.) 
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preceding  sentmce  (”Near  Anaphor",  which  refers  to  "Antecedent  2"),  and  the  other  to  a  referent  mentioned  three 
or  more  sentences  previously  ("Far  Am^^",  referring  to  "Antecedoit  1 ").  In  both  cases,  die  anaphor  was  a 
definite  noun  fdirase  describing  a  spatial  locsttion,  such  as,  "the  part  on  the  bottom."  Antecedent  and  am^r 
s^ments  were  aU  short  declarative  sentoices  in  which  antecedent  and  anaphor  phrases  occurred  at  the  beginning 
of  the  sentmce.  The  role  of  the  Fillo*  and  Coherence  Sentraice  s^moits  was  twofold:  first,  to  increase  the 
distance  in  die  text  between  the  far  anafdior  and  its  antecedem,  and  second,  to  improve  die  coherence  of  the  texts. 
To  cordrol  for  effects  of  specific  sentences  on  reading  time  and  conqirehension,  the  texts  were  designed  so  that 
either  critical  sentence  could  serve  as  Near  or  Far  Anaphor. 

In  addition,  texts  woe  sometimes  accompanied  by  pictures.  Figure  1  is  die  picture  that  could  accompany 
die  text  in  Table  1 .  Pictures  showed  the  structural  relationships  betweoi  parts  of  the  object  in  a  level  of  detail 
typical  of  a  high-school  or  undergraduate  textbook.  For  many  of  the  objects,  the  picture  suggested  a  diree- 
dimensional  representatioiL  Each  part  of  the  (4)ject  named  in  the  text  was  labeled  in  die  picture. 

On  one-third  of  the  trials,  a  picture  acconqianied  die  t»t  (full  picture  condition).  On  one-third  of  the  trials,  a 
picture  accompanied  the  text,  but  it  disappeared  before  the  near  anchor  soitence  was  di^layed  (dis£Q^)earing 
picture  condition).  For  the  remaining  texts,  no  picture  was  shown  (no  picture  condition).  Consider  the 
predictions  from  the  three  hypodieses  outlined  previously.  First,  suppose  that  the  picture  must  be  perceptually 
available  to  facilitate  anai^or  resolution.  In  this  case,  distance  and  picture  condition  should  interact  That  is,  a 
distance  effect  should  be  found  in  the  no  picture  condition,  but  diis  distance  effect  should  be  substantial^  reduced 
in  the  full  picture  condition  The  disappearing  picture  condition  should  parallel  the  no  picture  condition,  because 
the  hypothesis  supposes  diat  the  picture  must  be  percqitualty  present  to  facilitate  reinstatement  of  distant 
artecedents.  Second,  suppose  that  pictures  result  in  a  mental  representation  that  can  be  used  to  facilitate 
rdnstatemoit  of  antecedents  (memoiy  hypothesis).  In  tiiis  case,  there  should  again  be  an  interaction  between 
distance  and  picture  condition,  but  the  disappearing  picture  condition  should  parallel  the  full  picture  condition  (die 
distance  effect  should  be  reduced  in  both  cases).  Finally,  consider  die  possibility  that  pictures  oihance 
comprehension,  but  not  anaphor  resolution  (general  facilitation  hypothesis).  In  this  case,  pictures  will  produce  a 
main  effect  (better  performance  dian  in  the  no  picture  condition),  but  diCTe  should  not  be  an  interaction  with 
distance. 


We  used  two  dependent  variables.  One  was  reading  time.  Subjects  pressed  a  key  to  expose  successive 
pans  of  the  teit,  and  the  time  between  key  presses  was  recorded.  In  this  case,  an  interaction  between  distance  and 
picture  condition  would  be  revealed  by  a)  an  increase  in  reading  time  with  distance  in  the  no  picture  condition,  and 
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Figure  1  The  diagram  presented  with  the  text  of  Table  I 
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b)  a  smaller  increase  (ex’  no  increase)  with  distance  in  the  picture  conditions.  Unfortunately,  this  prediction  may 
be  compromised  by  die  mechanics  of  the  reading  situation.  That  is,  subjects  must  idiysically  move  dieir  eyes 
fixim  the  text  to  the  picture,  and  this  will  take  some  time.  Thus  if  subjects  only  refer  to  die  picture  in  the  far 
condition,  the  eye  movement  time  will  be  added  (»ily  in  this  amdition,  working  against  the  predicted  interv  <  ^n. 
Also,  having  die  picture  present  may  induce  other  strat^ies,  such  as  using  the  text  to  build  a  cognitive  model  of 
the  object,  and  using  the  picture  as  a  check  for  accuracy.  Thus  the  picture  may  slow  down  reading,  radio*  dian 
speeding  it  Ftx*  diese  reasons,  we  included  performance  on  con^nehension  questions  as  a  second  dependent 
variable. 

The  "near"  comprdioision  question  required  irttegration  of  information  presented  in  the  sentences 
containing  the  near  antecedent  and  the  near  anaphor.  Thrk  is,  the  Antecedent  2  sentoice  provided  the  name  of  a 
part  (e.g.,  "die  psaidopodium")  and  its  location  (e.g.,  "on  the  bottom").  The  Near  Ana|dior  sentence  referred  to 
the  part  using  die  spatial  location  (e.g.,  "die  part  on  die  bottom”)  and  desoibed  the  function  of  die  part  (e.g., 
"devates  the  entire  sporophyte").  The  comprdiension  tpiestion  required  die  association  of  the  name  (horn  the 
antecedent  sentence)  and  the  function  (from  the  aruqihor  sentence).  Similarly,  the  "far"  conqxefaension  question 
required  int^ration  of  information  presented  in  the  Antecedent  1  sentence  and  die  Far  Anaphor  sentence.  When 
performance  on  the  comprehension  questions  is  the  dependertt  variable,  and  interaction  between  distance  and 
picture  condition  would  be  revealed  by  a)  a  decrease  in  percent  correct  with  distaiKe  in  the  no  picture  condition, 
and  b)  a  smallo*  decrease  (or  no  deaease)  in  die  picture  condkions. 

Before  subjects  read  the  texts,  we  measured  their  reading  spans.  Daneman  and  Carpenter  (1980) 
dononstrated  that  reading  span  correlated  with  success  in  resolving  distant  referents  of  pronouns.  Supposedly, 
readers  with  large  reading  spans  can  more  easily  maintain  in  working  memory  potential  referents  than  can  readers 
with  smaUer  spans.  On  tlK  assunqition  diat  pictures  facilitate  working  memory  management,  we  expected  the 
following  pattern  of  ctxrelations.  Memory  span  should  correlate  with  performance  in  the  far  condition  when  diere 
are  no  pictures  and  readers  are  forced  to  rely  on  their  own  memories.  The  correlation  should  be  substantially 
reduced,  however,  whoi  pictures  are  present  and  readers  can  use  die  pictures  to  supplement  their  own  memories. 
Unfortunately,  the  correlations  with  reading  span  were  all  low  and  unsystematic  across  the  experiments.  Thus  we 
will  not  discuss  the  reading  ^an  measure  further  »cept  to  indicate  its  presence  in  the  procedure. 

Method 

Subjects.  The  48  subjects  included  both  paid  participants  and  students  enrolled  in  introductory 
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psydiok^  omirses  at  the  University  of  Wisconsin-Madison.  The  latter  received  extra  credit  for  participating.  All 
of  the  subjects  were  paid  a  cash  bonus  based  on  the  total  nund>er  of  correct  responses  on  the  comprehension 
questions.  Six  addhkxial  subjects  were  replaced,  five  because  computer  failures  made  h  impossible  to  analyze 
didr  data,  and  one  for  fiuling  to  exceed  chance  performance  on  the  comprehensions  questions. 

1IT«1  dgsiyn-  A  total  of  35  texts  (3  practice,  32  experimental)  were  used,  eadi  with  a  corresponding 
labeled  diagram  (as  shown  in  Figure  1).  Comprehension  questions,  which  tested  for  knowledge  of  bodi  spatial 
and  factual  information,  were  designed  to  motivate  subjects  to  read  all  sentoices  carefully;  as  a  further  incentive 
subjects  were  informed  in  advance  of  the  paymoit  of  10  cents  for  each  correct  answer.  Three  conqxehoision 
questions  in  multiple  choice  format,  each  with  four  possible  answers,  were  prepared  for  eadt  text  (see  Table  1). 
Correctly  answering  the  "Near  (Question"  required  subjects  to  in^rate  information  found  in  the  Antecedent  2  and 
Near  Anaphor  sentences.  Similarly,  a  "Far  (^estion"  required  int^rating  Anteced^  1  and  the  Far  Anaphor. 
"Filler  Oiestions"  were  included  to  prevent  subjects  from  identifying  specific  s^ments  of  the  text  upon  which  to 
focus  attentioiL 

Texts  and  pictures  were  displayed  side-by-side  on  a  Macinrosh  color  monitor.  Upon  responding  to  a 
prompt,  the  subject  was  shown  the  diagram  and  die  title  of  the  text  Each  subsequent  key  [x-ess  revealed  the  next 
sector  (see  below)  of  the  text  In  the  No  Picture  condition,  die  portion  of  the  screen  allocated  for  the  picture 
remained  blank,  and  only  the  text  was  shown;  in  the  Disappearing  Picture  condition,  the  diagram  disai^ieared  after 
the  key  press  that  revealed  the  Near  Anajdior  sentence  (as  idendfied  in  Table  1);  in  the  Full  Picture  condition  both 
picture  and  text  remained  visible  until  the  key  press  following  die  final  word  in  the  text  Thus  the  difference 
between  Disappearing  and  Full  Picture  conditions  was  the  status  of  the  picture  while  reading  the  critical  anaphor 
sentences. 

Eight  of  the  32  experimental  texts  were  used  as  unsoored  distractor  texts.  These  texts,  which  were  inteixied 
as  a  further  guard  against  subject  strat^izing,  included  some  comprehoision  questions  that  did  not  follow  the 
Near/Far/Filler  schone.  In  addition,  for  these  texts,  the  picture  disappeared  after  the  presentation  of  a  randomly 
sdected  wtxxl,  rather  than  after  the  end  of  Antecedent  2.  The  same  eight  texts  woe  used  as  distractors  for  all 
subjects. 

Two  factors  were  manipulated  within  subjects.  One  factor  was  picture  presence:  each  subject  read  8  texts 
in  each  of  the  three  conditions  (No  Picture,  Disappearing  Picture,  and  Full  Picture).  The  second  factor  was 
anaphtn*  distance,  with  two  levels:  near  and  far.  Each  text  contained  a  near  and  far  anaphor,  each  with  a  related 


AFOSR-89-0367 


^imuai  Technical  Report  Page  1 1 

71/90  through  6/30/91 

comprdiension  question.  Texts  were  presented  in  a  different  random  order  for  each  subject,  with  each  text 
qjpearing  equally  oftoi  in  the  three  picture  conditions.  In  addition,  each  text  was  seen  equally  often  in  its 
standard  order  and  with  Near  and  Far  Anaphor  sentences  (and  corresponding  antecedent  and  coherence  sentences) 
interchanged. 

Procedure.  Subjects  were  instructed  to  aim  for  comprdiension.  They  were  also  instructed  that  pictures 
mi^t  disa^iear,  "as  if  they  had  turned  the  page  of  a  book. "  After  the  display  of  the  tide  and  diagram,  texts  were 
presented  one  word  at  a  time  in  a  cumulative  displ^:  the  text  was  initially  shown  widi  dashes  representing  the 
letters  in  each  word.  Readers  controlled  the  rate  of  presentation  by  pressing  the  space  bar  when  they  were  ready 
for  a  new  word;  after  the  key  press,  one  sector  of  the  text  became  visible,  replacing  die  dashes  that  had 
nqiresented  ft.  A  sector  of  text  was  normally  a  single  word,  but  two-word  infinitive  phrases  and  those  beginning 
uvidi  an  article  (e.g.,  "the  operculum")  were  presented  as  single  sectors.  Previously  exposed  sectors  remained 
visible  on  the  saeen.  The  reading  time  (time  between  key  presses  in  msec)  on  each  segment  of  the  text  was 
measured.  In  the  practice  trials  only,  after  reading  a  text,  subjects  answered  the  three  multiple  choice 
romprehension  questions  in  sequence  by  typing  in  the  letter  of  the  alphabet  corresponding  to  the  diosen  answer. 
During  the  experimental  portion,  subjects  read  a  block  of  four  texts,  then  they  answered  the  twelve  questions 
relating  to  the  texts  in  the  block. 


Results 


After  analyzing  the  data,  ft  was  found  that  one  subject's  total  of  21  correct  answers  was  more  than  3 
standard  deviations  below  die  overall  mean;  this  subjea  was  omitted  from  all  analyses. 


Reading  Times.  Median  reading  times  in  each  of  the  picture  conditions  for  each  subject  were  computed  for 
he  critical  anaphor  sentences;  the  means  of  these  median  reading  times  are  shown  in  Figure  2  V  The  expected 

iffect  of  distance  was  found  for  the  Full  Picture  and  Disappecring  picture  conditions.  That  is,  subjects  took 
onger  to  read  the  sentence  containing  the  far  anapho*  than  the  sentences  containing  the  near  anaphors.  There  is 
ilso  evidence  of  an  interaction  between  between  picture  condition  and  distance.  Note  however,  that  this 
nteraction  is  the  opposite  of  the  one  predicted  by  the  percqitual  hypothesis  and  by  the  memory  hypothesis.  That 


For  two  reasons,  die  main  analysis  is  on  the  time  to  read  the  anaphor  setttences,  rather  than  the  time  to  read  the 
inaphor  sectors  of  the  sentences.  First,  because  of  a  programming  error,  the  sector  data  were  not  available  for  all 
>f  the  texts.  Second,  because  the  previously  exposed  text  was  available  for  inspection,  some  subjects  might  have 
ilected  to  expose  the  vvhole  anaphor  sentence  (by  multiple  key  presses)  before  reading  the  sentence  carefully.  In 
iny  event,  an  analysis  of  the  times  spent  on  the  anaphor  sectors  was  in  all  essential  respects  like  the  analysis  of  the 
riiole  sentences.  In  addition,  the  mediodology  used  in  Experiment  2  will  allow  an  analysis  of  die  time  to  read  the 
naphor  sectors  as  well  as  the  time  to  read  the  anaphor  sentences. 


Full  Picture 
Dis.  Picture 
No  Picture 
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is,  instead  of  !q;)eeding  reading  by  facilitating  anaphor  resolution  in  the  far  condition,  the  pictures  appear  to  have 
slowed  reading  relative  to  the  No  Picture  omdidon.  These  observations  were  confirmed  by  analyses  of  variance 
(ANOVA)  performed  using  subjects  as  the  random  effect  (statistics  reported  as  "Fi")  and  using  texts  as  the 
random  effect  ("£2").  hi  all  cases,  an  alpha  level  of  .05  has  been  used  for  significance  tests.  The  main  effects  of 
picture,  distance  and  die  picture  by  distance  interacdon  were  all  significant  For  picture,  £1(2,92)  was  3.27, 

=  2036189,  £2(2,46)  =  6.67,  MSE  =  975448;  for  distance,  £1(1,46)  =  10.78,  MSE  =  1875759,  E(l,23) 

=  31.29,  MSE  =  532208;  for  the  interacdon,  £1(2,92)  was  5.98,  MSE  =  1856294,  £2(2,46)  =  7.62,  MSE  = 
820365.  Single  effects  analyses  indicated  that  die  pattern  of  inaeasing  reading  time  widi  increasing  distance  was 
significant  in  both  dis^ipearing  picture  (£1(1,46)  =  7.26,  MSE  =  22S0009,  £2(1,23)  =  15.00,  MSE  =  626394) 
and  full  picture  condidons  (£1(1,46)  =  14.52,  MSE  =  1679978,  £2(1,23)  =  20.42,  MSE  =  958342);  in  the  no 
picture  condidon,  in  contrast  the  single  effect  of  distance  was  not  significant  (£1(1,46)  =  .91,  MSE  =  1628361, 
£2(1,23)  =  .30,  MSE  =  588201). 

Accuracy.  In  a  task  in  vihich  25%  constitutes  chance  perfOTmance,  mean  overall  accuracy  responding 
to  conqirdiension  quesdons  was  81.1%  (see  Figure  3).  Main  effects  were  significant  for  both  picture  (£1^,92) 
=  5.10,  MSE  =  224.23,  £2(2,46)  =  3.32,  MSE  =  166.62)  and  distance  (£1(1,46)  =  13.43,  MSE  =  224.23, 

£2(1 ,23)  =  5.90,  MSE  =  180.72).  Predictably,  accuracy  increased  as  distance  in  text  decreased,  and  as  picture 
availability  increased.  A  central  dieoredcal  prediaion  involved  a  picture-by-distance  interacdon,  but  the  interaction 
did  not  even  approach  significance  (£1(2,92)  =  0.45,  MSE  =  139.05,  £2(2,46)  =  0.43,  MSE  =  65.33). 

Analysis  By  Reading  Speed.  Because  individuals  vary  widely  in  reading  speed  and  reading  strategies,  a 
median  split  was  made  on  the  basis  of  overall  reading  dme,  and  the  data  were  analyzed  using  this  factor.  The  one 
sub;  xt  with  the  median  overall  reading  time  was  onutted  from  the  analysis.  No  difference  was  found  between 
slow  and  fast  readers  on  die  conprehoision  measure  QE1(1,44)  =  .54,  n.s.).  For  the  sentence  reading  times,  the 
three-way  interacdon  of  reading  speed,  distance,  and  picture  was  significant  (£1(2,88)  =  4.50);  simple  effects 
analyses  indicated  the  presence  of  a  distance  by  picture  interacdon  for  slow  readers  (£1(2,88)  =  10.21),  whereas 
this  interaction  was  not  significant  for  fast  readm  (£1(2,88)  =  .05).  Figure  4  shows  the  pattern  of  reading  times 
for  the  slow  readers;  in  this  group,  reading  time  did  not  increase  with  distance  in  the  No  Picture  condidon. 
Inqwrtantly,  data  from  botfi  the  slow  readers  and  the  fast  readers  were  contrary  to  the  predicdons  derived  from  the 
memory  hypothesis  and  die  perceptual  hypothesis. 


Analysis  Bv  Half  of  Experiment  To  test  for  an  interaction  between  dieorctical  factors  and  either  pracdce  or 
fadgue  effects,  performance  during  the  first  and  second  halves  of  the  experiment  was  compared.  For  the  percent 
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ccMTect  measure,  die  second  half  mean  was  higher  than  die  first  half  (81 . 1  vs.  79.7),  but  neither  the  main  effect 
nor  ai^  interaction  was  significaitt  (El  <  1.40,  p  >  .25  in  all  cases).  There  was  a  significant  main  effect  of  half 
on  soitoKe  reading  times  (£1(1,46)  =  15.81,  MSE  =  142271860),  with  subjects  taking  1005  msec  longer  to  read 
critical  sentences  during  die  first  half.  Interactions  involving  half  were  non-significant,  (FI  <  2.19.  p  >  .10  in  all 
cases). 


Discussion 

We  used  distance  as  a  conceptual  maiker  for  anaphor  resolution,  thus  interpretation  of  die  results  requires 
diat  we  dontmstrate  a  significant  distance  effect  Indeed,  the  di^ance  effect  was  significant  for  both  depoident 
variables;  resolution  of  far  anchors  required  more  time  and  produced  less  accurate  ^estion  answering  than 
resohidmi  of  near  amphors.  We  are  not  certain  iKhy  diere  was  no  distance  effect  in  the  reading  times  in  the  no 
picture  condition.  Perhaps  subjects  fmd  die  task  so  difficuk  (in  the  absence  of  pictures)  that  they  read  all  of  the 
sentences  very  slowly.  Given  that  the  distance  manipulation  was  effective,  what  can  we  say  about  the  three 
hypotheses  described  in  the  introduction?  There  was  an  interaction  between  distance  and  picture  condition,  but 
diis  interaction  was  just  the  opposite  of  that  predicted  by  the  percqitual  hypothesis  and  the  memoiy  hypothesis. 
That  is,  when  subjects  had  pictures  available,  they  read  the  far  anaphor  sentence  slower  than  the  near  anafhor 
sentence.  The  interaction  we  obtained  provides  no  support  for  die  two  hypothesis,  but  neidier  is  the  interaction 
fatal  As  mentioned  in  die  introduction,  die  picture  may  induce  reading  strat^ies  that  would  countermand  the 
predictions  r^arding  reading  times.  Because  of  this  possibility,  we  included  the  conqirdiension  questions. 
Unfortunately  for  the  perceptual  and  memory  hypotheses,  the  comprehension  question  data  do  appear  to  be  fatal. 
There  was  a  distance  effect,  but  litde  or  no  evidence  for  an  interaction  of  distance  and  picture  condition.  Thus  the 
wdght  of  the  evidence  is  that  pictures  do  not  affect  angora  resolution. 

Experiment  2 

There  are  several  reasons  for  being  suspicious  about  the  conclusion  that  pictures  do  not  affect  anaphor 
resolution.  First,  it  dqiends  on  accqiting  the  null  hypothesis  (we  will  presm  a  power  analysis  in  the  Graeral 
Discussion).  Second,  it  rests  in  part  on  the  conprefa^ion  question  data,  which  is  an  off-line  measure,  whereas 
anaihOT  resolution  is  an  on-line  {xocess.  Third,  it  may  be  peculiar  to  die  experimental  context  That  is,  subjects 
may  have  become  sensitive  to  the  requironents  of  die  comprehension  questions  and  chosen  to  read  excptionally 
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carefully,  and  periu^s  reread  die  text,  ixiiedier  or  not  the  picture  was  available.  This  strata  may  have  been 
mcouraged  by  the  continuous  availability  of  the  text  Experiment  2  was  conceptually  similar  to  Experiment  1 , 
excqit  for  two  changes.  First  a  moving  window  methodology  (Just  Carpwiter,  and  Wooley,  1982)  was  used. 
Each  key  press  exposed  a  single  word  (or  short  phrase)  and  erased  the  previously  eiqposed  text  This 
methodology  eliminates  the  opportunity  to  reread  the  text  and  thus  should  enhance  the  inteqiretability  of  die 
reading  time  data.  Second,  we  drcqiped  the  dis^pearing  picture  condition  for  two  reasons.  It  did  not  add  mudi 
useful  information  in  Experiment  1,  and  because  subjects  knew  that  pictures  would  disappear  frequottly,  the 
subjects  may  have  adopted  unusual  reading  or  picture  inspection  strat^es. 

Mssbod 

Subjects.  Ihirty-six  undergraduate  students  at  the  University  of  Wisconsin-Madison  contributed  Haia  As 
before,  they  received  both  course  credit  for  participating  and  a  cash  bonus  based  on  die  accuracy  of  thdr 
responses  to  (piestion.  Three  subjects  were  replaced,  one  for  failing  to  achieve  above  chance  performance  on  the 
(piestions,  and  two  because  of  computer  failures. 

Materials  and  design.  Texts  and  diagrams  were  the  same  as  those  used  in  Experiment  1,  but  one  practice 
text  and  four  distractor  texts  were  omitted,  leaving  a  total  of  2  practice  and  28  experimental  texts,  24  of  wliich 
were  scored,  for  each  subject  The  one  change  firom  Experiment  1  in  the  presentation  of  texts  and  diagrams  was 
that  a  moving  window,  as  described  by  Just,  Carpenter,  and  Wooley  (1982),  was  employed.  The  initial  display 
consisted  of  dashes  representing  all  non-space  characters  in  the  text  Upon  a  key  press  by  the  reader,  one  sector 
of  the  text  appeared  to  rqilace  the  dashes  corresponding  to  it  Chi  the  next  k^  press,  the  previous  sector  was 
replaced  again  by  dashes,  and  one  new  sectcx  was  revealed.  Thus,  the  reading  times  measured  represent  the  time 
during  which  the  words  in  a  s^ment  of  text  were  available  for  reading. 

Picture  availability  and  distance  were  again  manipulated  widiin  subjects.  Two  levels  of  die  picture  variable 
were  used:  No  Pictue,  and  Ftill  Picture.  Therefore,  each  subject  contributed  12  Near  and  12  Far  observations  in 
tiodi  of  the  two  picture  conditions. 

Results 

Reading  Times.  Because  reading  time  for  an  individual  word  is  meaningful  in  the  moving  window 
presentation,  analyses  were  performed  on  reading  times  for  the  anaphor  phrases  thonselves  as  well  as  fm*  die  foil 
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sentences  containing  them.  There  were  no  significant  effects  on  the  phrase  reading  times,  Es  <  2.4,  p  >  .10  for 
all  analyses  perftxmed.  Reading  times  for  the  foil  sentences  are  shown  in  Figure  S.  Hie  pattern  of  results  is  very 
similar  to  diat  found  in  E^ierimait  1.  There  were  significant  effects  of  picture  (FI  f  1.351  =  19.49,  MSE  = 
595377,  E(1^3)  =  12.83,  MSE  =  503150),  and  picture  by  distance  interaction  fFUl.351  =  5.28,  MSE  = 
590623,  £2(1,23)  =  6.13,  MSE  =  257488).  The  effect  of  distance  qiproadied  significance  for  subjects, 

El(l,35)  =  3.54,  MSE  =  484865, .  10  >  p  >  .05,  but  not  for  t  'ts,  E2£1.23)  =  1.30,  MSE  =  389154.  The  same 
pattern  of  simple  effects  was  found  as  in  Experiment  1,  with  distance  having  a  significant  effect  in  die  foil  picture 
condition  (£1(1,35)  =  5.51,  MSE  =  858047,  E(l,23)  =  4.41,  MSE  =  439072),  but  a  non-significant  effect  in 
die  no  picture  condition  ( £1(1,35)  =  .48,  MSE  =  217441,  £2(1,23)  =  .71,  MSE  =  207570).  Note  that  this 
pattern  is  oontnuy  to  die  predictions  derived  from  memory  and  die  perceptual  hypotheses. 

Accuracy.  Percent  correct  responses  to  comprdiension  questions  are  shown  in  Figure  6.  As  in  Experiment 
1,  main  effects  of  picture  (£1(1,35)  =  89.7,  MSE  =  136.10,  £2(1,23)  =  26.03,  MSE  =  312.53)  and  distance 
(£1(1,35)  =  25.66,  MSE  =  74.73,  E(l,23)  =  12.71,  MSE  =  100.46)  were  both  significant  The  picture-by¬ 
distance  interaction  was  not  significant  both  £s  <  1 .  Failure  to  fold  this  interaction  is  contrary  to  both  the 
memory  and  the  percqitual  hypodieses. 

Analysis  By  Reading  Speed.  In  Expoiment  2,  slow  readers  performed  significantly  better  on  the 
comprehension  measure  than  did  fast  readers  (80.7%  vs.  65.5%,  £1(1,34)  =  1 1.57).  For  the  sentence  reading 
times,  the  three-way  interaction  of  reading  speed,  distance,  and  picture  was  again  significant  (£1(1,34)  =  5.89); 
fOT  this  experimoit,  the  reading  speed  by  distance  interaction  was  also  significant  (£1(1,34)  =12.65),  and  again 
simple  effects  analyses  indicated  the  presence  of  a  distance  by  picture  interaction  for  slow  readers  (£1(1,34)  = 
14.21),  but  not  for  fast  reado^  (El(2,88)  =  .11).  As  Figure  7  indicates,  the  basis  of  the  interaction  is  that  reading 
time  increased  with  greater  distance,  but  only  for  the  slow  reader  group,  and  only  wiien  a  picture  was  available. 
Analyzing  the  phrase  time  measure  indicated  that  no  interactions  involving  reading  ^leed  and  other  variables  were 
significant,  £l's  <  2.70,  p  <  .10. 

Analysis  By  Half  of  Experiment  The  same  pattern  of  results  was  obtained  as  in  Experiment  1.  Second  half 
comprehoision  exceeded  first  half  by  3.7%,  but  main  effects  and  interactions  of  half  on  this  dependent  measure 
were  nm  significant  ( £1  <  2.71,  p  >  .10  for  all).  Reading  times  for  critical  sentences  were  877  msec  longer  in 
the  first  half,  and  this  differoice  was  significant  ( £1(1,35)  =  24.88,  MSE  =  2230202).  Interactions  between  half 
and  (Mher  factors  were  non-significant,  with  £ls  <  1. 
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PiscusaoD 

Althou^  the  mediodology  used  in  Experiment  2  differs  from  that  in  Experiment  1  in  several  inportant 
re^>ects,  die  results  of  die  two  experimoits  are  quite  similar.  In  particular,  fcx*  the  reading  times,  there  was  a 
significant  interaction  between  distance  and  picture  condition,  but  the  interaction  was  just  die  opposite  of  that 
expected  by  the  monory  and  percqmial  hypodieses.  Coifrbined  with  the  failure  to  find  an  interaction  in  the 
conqvehension  data,  there  are  strong  grounds  for  rejecting  the  hypotheses. 

It  is  not  surinising  diat  the  reading  time  measure  revealed  an  effect  of  distance  only  for  the  full  anaphor 
sentences,  radier  than  on  the  reading  time  for  the  amqdKx-  itself.  Ehrlich  and  Rayner  (1983)  demonstrated  just 
such  an  effect  using  durations  of  eye  fixatitms  as  the  dependmt  vari^le.  They  concluded  that  only  lexical  access 
and  syntactic  parsing  are  carried  out  immediately  upon  fixation  of  a  word;  other  imcesses,  such  as  assigning  a 
referent,  are  completed  later.  Van  den  Broek  and  Thurlow  (1990)  repotted  an  analogous  finding  whoi  measuring 
the  time  to  make  causal  inferences.  That  is,  die  increase  in  reading  time  (with  difficulty  of  the  inference)  occurred 
after  the  aitical  sentence  demanding  die  inference,  not  while  reading  that  sentence. 

Gcpcrat  Piacusaion 

In  this  discussion  we  will  consido'  two  issues,  possible  problems  with  our  conclusion  diat  pictures  do  not 
facilitate  anaphcH*  resolution,  and  what  accounts  for  die  large  facilitation  in  comprehension  due  to  pictures. 

There  are  several  w^s  in  which  our  results  are  limited,  but  not  critically  so.  First,  we  used  only  two  values 
of  distance,  near  and  far.  Periuqis  the  values  we  choose  are  not  optimum  for  demonstrating  effects  consistent 
with  die  memory  and  perceptual  hypotheses.  This  possibility  can  be  dismissed  readily.  We  did  obtain  distance 
effects  in  both  reading  time  and  in  a  measure  of  comprehension.  The  problem  was  in  failing  to  fuid  the  predicted 
interactions.  Also,  Ehrlich  and  Rayner  manipulated  distance  over  three  values,  near,  intermediate  (roughly 
cmresponding  to  our  near),  and  far.  The  largest  difference  that  they  observed  were  between  die  intermediate  (our 
near)  and  the  far  conditicMis. 

A  second  limitation  is  that  we  used  only  one  type  of  anaphora,  referring  to  a  part  by  its  spatial  location. 

This  is  somewhat  unusual  in  the  literature,  but  it  is  just  the  sort  of  anaphor  that  should  have  revealed  a  benefit 
from  a  spatially  organized  picture.  Nonetheless,  no  benefit  (in  anafdior  resolution)  was  found.  We  used  a 
different  sort  of  anaphor,  exact  repetitkxi  of  a  part's  name,  in  several  pilot  experiments  (some  in  which  whole 


Annual  Technical  Rqxxt  Page  23  AFOSR-89-0367 

7/1/90  through  6/3(V91 

sentences  were  exposed  with  each  key  press  and  some  using  the  moving  window  technique).  None  of  those 
experin^nts  revealed  any  evidoice  in  su|^[X)rt  of  the  memory  or  perc^mial  hypotheses. 

Third,  one  could  argue  that  repetition  of  a  part's  spatial  location  can  be  an  inelegant  form  of  reference.  That 
is,  a  concept  is  for^rounded  (die  curroit  topic),  it  is  easier  to  refer  to  the  concept  using  a  pronoun  than  a 
noun  phrase;  foil  noun  phrases  seem  to  be  required  mily  when  die  concept  is  not  the  current  tcpic  (see  Gairod  and 
SanfOTd,  1990,  and  (jordon,  Grosz,  Gertner,  and  Rabin,  1990,  as  exanqiles).  Whereas  diis  criticism  is  well- 
founded,  it  predicts  just  the  c^iposite  of  what  we  found.  That  is,  if  die  noun  phrase  is  an  inel^ant  form  of 
reference  in  die  near  condition  (because  the  antecedent  is  the  subject  of  the  previous  sentence  so  may  well  be 
for^rounded),  dien  reading  times  should  be  slower  in  the  near  condition  than  in  the  far  condition.  A  glance  at 
either  Figures  2  or  5  will  confirm  that  this  did  not  occur. 

Finally,  our  conclusion  rests  on  accepting  the  null  hypothesis  of  no  interaction  between  picture  condition 
and  distance,  and  so  the  question  arises  as  to  whether  or  not  the  experiments  had  sufficient  power  to  detect  the 
interaction.  In  Experimoit  1,  there  was  sufficient  power  to  have  a  0.5  chance  of  detecting  an  interaction  effect 
(gamma)  as  small  as  .284.  In  Experiment  2,  an  effect  size  of  .325  could  have  been  detected  with  a  probability  of 
0.5.  Considering  the  two  experiments  as  indepoident  replications,  there  was  a  greater  than  0.75  probability  of 
detecting  an  interaction  effect  as  small  as  .325.  Thus,  it  seems  safe  to  conclude  that  the  interaction  is  either  absent 
or  small. 

There  is  no  doubt  that  pictures  focilitated  con^ehension.  The  picture  effect  was  significant  in  both 
experiments  (by  subjects  and  by  texts),  and  in  Experiment  2  it  was  quite  large;  subjects  averaged  64%  correa  in 
the  no  picture  condition  and  82%  correct  in  die  picture  conditioa  If  pictures  are  not  helping  to  resolve  anaphors, 
what  is  bdiind  tiiis  facilitation?  Clearly,  diere  is  little  in  the  present  data  that  will  help  to  specify  a  single  answer, 
but  the  data  are  consistent  with  several  possibilities,  and  inconsistent  with  at  least  one  other.  The  one  unlikdy 
answer  is  that  having  a  picture  acts  to  inaease  study  time:  tiiat  is,  die  picture  slows  down  reading  and  therd)y 
imoduces  better  comprehension.  Although  the  data  are  roughly  consistent  with  diis  hypothesis  (longer  reading 
times  and  better  comprehension  with  pictures  than  without),  a  finer-grained  analysis  reveals  trouble.  For 
example,  in  Experiment  2  in  the  near  condition,  there  is  little  difference  in  reading  time  for  foil  pictures  and  for  no 
pictures,  however  there  is  a  large  difference  in  the  comprehension  scores.  Also,  in  both  experiments,  subjects 
ttxdt  longer  to  read  the  far  sentence,  but  did  more  poorly  on  the  far  conprehension  question. 


One  hypothesis  that  is  consistent  with  the  data  is  dual  code  dieory  (Paivio,  1986).  In  outline,  processing  the 
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picture  results  in  a  Icxig-term  spatial  representation,  wliereas  processing  the  text  produces  a  long-term  verbal 
represratatioa  The  two  refveseitadons  are  different  in  kind,  but  there  may  be  associations  relating  concepts  with 
the  same  referent  On  this  account,  diere  is  no  reason  to  suspect  that  pictures  would  help  in  the  on-line 
resolution  of  anaphora  because  Aat  is  a  ved^al/linguistic  task  carried  out  separately  from  pictorial  processing. 
Nonedieless,  the  pictorial  representation  may  be  useful  when  answering  the  comprehension  questions.  When  the 
encoded  verbal  informiaion  is  insufficiott  to  provide  an  answer,  the  subject  could  consult  the  pictorial 
representation  for  the  cexrect  answer.  There  is  no  reason  to  suspect  that  the  quality  of  the  pictorial  represratation 
would  vary  widt  die  linguistic  variable  of  distance,  hence,  die  independence  of  distance  and  picture  conditions  is 
consistent  widi  this  approach.  Note  that  this  hypothesis  does  not  exploit  the  concept  of  working  memory 
managemok. 

A  second  possibility  is  that  the  pictures  and  the  text  interact  in  creating  a  more  effective  representation  than 
diat  based  on  eidier  modality  alone.  For  example,  the  picture  may  help  to  disambiguate  the  text  and  thereby 
cemtribute  to  the  encoding  of  extra-textual  relationships.  For  example,  although  the  text  in  Table  1  gives  the 
relative  spatial  positions  of  die  operculum,  the  columella,  and  the  pseudopodhim,  the  text  does  not  qiecify  diat  the 
three  structures  are  contiguous.  Knowing  that  die  structures  are  contiguous  should  facilitate  the  mcoding  of 
spatial  and  other  relationships.  That  is,  knowing  that  the  operculum  maintains  internal  pressure  directly  qq  the 
columella,  presumably  makes  it  easier  to  understand  the  relationship  between  the  internal  pressure  and  the 
popping  and  release  of  the  spores.  This  second  possibility  was  supported  by  Glenbetg  and  Langston  (in  press). 
Their  data  indicated  that  a  picture  could  help  to  emphasize  particular  relationships. 

This  second  possibility  that  pictures  and  text  interact  is  consistent  widi  the  concept  of  working  memoy 
manag^nent  in  two  ways.  First,  because  the  picture  specifies  the  exact  spatial  relations  of  the  parts,  these 
rdittionships  can  be  represented  (at  least  initially)  by  the  ^atial  component  of  working  memory.  Second,  it  is 
probabfy  easier  to  maintain  spatial  rdationsh^s  in  working  memory  given  the  direct  perceptual  support  of  the 
picture  than  if  die  spatial  relationshqis  had  to  be  maintained  in  the  absence  of  perceptual  support  These 
suggestions  remain  to  be  empirically  tested. 
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